Collaborative Intelligent Reflecting Surface Networks With Multi-Agent Reinforcement Learning

نویسندگان

چکیده

Intelligent reflecting surface (IRS) is envisioned to be widely applied in future wireless networks. In this paper, we investigate a multi-user communication system assisted by cooperative IRS devices with the capability of energy harvesting. Aiming maximize long-term average achievable rate, an optimization problem formulated jointly designing transmit beamforming at base station (BS) and discrete phase shift IRSs, constraints on power, user data rate requirement buffer size. Considering time-varying channels stochastic arrivals harvested first formulate as Markov decision process (MDP) then develop novel multi-agent Q-mix (MAQ) framework two layers decouple parameters. The higher layer for optimizing resolutions, lower one power allocation. Since integer programming large-scale action space, improve MAQ incorporating Wolpertinger method, namely, MAQ-WP algorithm achieve sub-optimality reduced dimensions space. addition, still high complexity good performance, propose policy gradient-based algorithm, MAQ-PG, mapping actions into continuous space cost slight performance loss. Simulation results demonstrate that proposed MAQ-PG algorithms can converge faster improvements 10.7% 8.8% over conventional DDPG, respectively.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Agent Reinforcement Learning

This thesis presents a novel approach to provide adaptive mechanisms to detect and categorise Flooding-Base DoS (FBDoS) and Flooding-Base DDoS (FBDDoS) attacks. These attacks are generally based on a flood of packets with the intention of overfilling key resources of the target, and today the attacks have the capability to disrupt networks of almost any size. To address this problem we propose ...

متن کامل

Multi-Agent Reinforcement Learning with Vicarious Rewards

Reinforcement learning is the problem faced by an agent that must learn behaviour through trial-and-error interactions with a dynamic environment. In a multi-agent setting, the problem is often further complicated by the need to take into account the behaviour of other agents in order to learn to perform effectively. Issues of coordination and cooperation must be addressed; in general, it is no...

متن کامل

Multi-Agent Deep Reinforcement Learning

This work introduces a novel approach for solving reinforcement learning problems in multi-agent settings. We propose a state reformulation of multi-agent problems in R that allows the system state to be represented in an image-like fashion. We then apply deep reinforcement learning techniques with a convolution neural network as the Q-value function approximator to learn distributed multi-agen...

متن کامل

Multi-agent Relational Reinforcement Learning

In this paper we study Relational Reinforcement Learning in a multi-agent setting. There is growing evidence in the Reinforcement Learning research community that a relational representation of the state space has many benefits over a propositional one. Complex tasks as planning or information retrieval on the web can be represented more naturally in relational form. Yet, this relational struct...

متن کامل

Learning to Communicate with Deep Multi-Agent Reinforcement Learning

We consider the problem of multiple agents sensing and acting in environments with the goal of maximising their shared utility. In these environments, agents must learn communication protocols in order to share information that is needed to solve the tasks. By embracing deep neural networks, we are able to demonstrate endto-end learning of protocols in complex environments inspired by communica...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Journal of Selected Topics in Signal Processing

سال: 2022

ISSN: ['1941-0484', '1932-4553']

DOI: https://doi.org/10.1109/jstsp.2022.3162109